Reading order detection on handwritten documents

نویسندگان

چکیده

Abstract Recent advances in Handwritten Text Recognition and Document Layout Analysis have made it possible to convert digital images of manuscripts into electronic text. However, providing this text with the correct structure context is still an open problem that needs be solved actually enable extracting relevant information conveyed by The most important needed for a set elements their reading order. Most studies on order are rule-based approaches focus printed documents. Much less attention has been paid so far handwritten documents, where becomes particularly important—and challenging. In work, we propose new approach automatically determine regions lines task approached as sorting order-relation operator learned from examples. We experimentally demonstrate effectiveness our method three different datasets at hierarchical levels.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annoflow - Handwritten Annotation and Proof- reading on Dynamic Digital Documents

Phil Crosby Michael Quinn François Guimbretière Department of Computer Science Human-Computer Interaction Lab University of Maryland, College Park, MD, 20742 [email protected] [email protected] [email protected] ABSTRACT Proof-reading digital documents is a difficult task, because the ink annotations made on documents do not maintain their relevance as the document changes. In addition, applyi...

متن کامل

Repudiation Detection in Handwritten Documents

Forensic document verification presents a different and interesting set of challenges as opposed to traditional writer identification and verification tasks using natural handwriting. The handwritten data presented to a forensic examiner is often deliberately altered, in addition to being limited in quantity. Specifically, the alterations can be either forged, where one imitates another person’...

متن کامل

Text line detection in handwritten documents

Article history: Received 13 April 2007 Received in revised form 26 March 2008

متن کامل

Connected Component Based Word Spotting on Persian Handwritten image documents

Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...

متن کامل

On Segmentation Methods for Handwritten Arabic Documents

In the literature, two methods for the extraction zones of the document are more used. The first method is based on the Mathematical Morphology (MM). The second is based on Hough Transform (HT). The main contribution of this paper is the application of these methods to extract the handwritten components of the complex document. The second contribution is the combination between the HT and the M...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neural Computing and Applications

سال: 2022

ISSN: ['0941-0643', '1433-3058']

DOI: https://doi.org/10.1007/s00521-022-06948-5